Evaluating Content Extraction from Audio Sources

نویسندگان

  • Lynette Hirschman
  • John Burger
  • David Palmer
  • Patricia Robinson
چکیده

This paper discusses evaluation of content extraction from audio sources. The most straightforward approach is to adapt existing methods for written sources to handle audio input. A transcription then becomes the representation of the audio source in written form; it must capture the word stream, but also other information that aids in decoding the overall structure and content of the audio source, e.g., music, speaker changes, and speech repairs. The transcription must also support content annotation superimposed on the underlyin speech transcription. When automated speech recognition is used to generate the transcription, there is the additional problem of evaluating content extraction from a noisy transcription. In addition, audio sources differ from their written counterparts in genre and therefore in structure, vocabulary, and even in how names are used. If the audio includes spontaneous conversational speech, as opposed to planned speech, these differences become still more pronounced. We discuss how these differences affect the adaptation of text-based extraction evaluation to audio input. In addition, we describe two new content extraction evaluations that have been designed for use with both audio and written materials.s

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audio Feature Extraction and Analysis for Scene Segmentation and Classification

Understanding of the scene content of a video sequence is very important for content-based indexing and retrieval of multimedia databases. Research in this area in the past several years has focused on the use of speech recognition and image analysis techniques. As a complimentary effort to the prior work, we have focused on using the associated audio information (mainly the nonspeech portion) ...

متن کامل

Audio-Visual Content Analysis for Content-Based Video Indexing

An audio-visual content analysis method is presented, which analyzes both auditory and visual information sources and accounts for their inter-relations and coincidence to extract high-level semantic information. Both shotbased and object-based access to the visual information is employed. Due to the temporal nature of video, time has to be accounted for. Thus, time-constrained video labelling ...

متن کامل

New Compression Techniques for Content - based Retrieval

Due to the proliferation of multimedia information over the internet, users are confronted with large amounts of content from many sources around the world. Techniques that enable users to efficiently search and exchange information are greatly needed. Content-based retrieval system was proposed to automatically annotate and index the multimedia information by their own audio/visual contents in...

متن کامل

Dependent Video Indexing Based on Audio - VisualInteractionS

A content-based video indexing method is presented in this paper that aims at temporally indexing a video sequence according to the actual speaker. This is achieved by the integration of audio and visual information. Audio analysis leads to the extraction of a speaker identity label versus time diagram. Visual analysis includes scene cut detection, face shot determination, mouth region extracti...

متن کامل

Audio Information Extraction from Arbitrary Sound Recordings

Numerous archives of entertainment soundtracks and other recordings such as environmental noise samples have imposed a big data challenge in audio related industries. This necessitates the use of machine audition and retrieval tools to extract semantic information for various applications. Speech recognition, environmental noise classification and music information retrieval tools haven been de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999